Multi-Oriented Text lines Detection and Their Skew Estimation

نویسندگان

  • U. Pal
  • S. Sinha
  • B. B. Chaudhuri
چکیده

There are some documents where text lines are not parallel to each other i.e. different text lines of a single page may have different inclinations (orientations) with the horizontal lines. To enhance the ability of document analysis system, we need text line extraction in multiple orientations. In this papers we propose a robust technique (a) to detect text lines of arbitrary orientation in a single document page, and (b) to detect skew angle of individual text line. We use here a bottom-up approach where the connected components are at first labeled. They are then clustered into word groups. Text lines of arbitrary orientation are segmented from the estimation of these word groups. From an experiment of 3045 text lines we obtained an accuracy of 97.7% by the proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Skew angle estimation for printed and handwritten documents using the Wigner-Ville distribution

A skew estimation algorithm for printed and handwritten documents, based on the document’s horizontal projection profile and its Wigner–Ville distribution, is presented. The proposed algorithm is able to correct skew angles that range between 289 and þ898 detecting the right oriented position of the page by the alternations of the horizontal projection profile. It is able of processing successf...

متن کامل

Part-Based Skew Estimation for Mathematical Expressions

We propose a novel method for the skew estimation on text images containing mathematical expressions which can be applied to various characters layouts. Current OCR systems are not capable of recognizing skewed characters in images correctly, and hence skew correction in such images is essential for character recognition. Conventionally methods such as projection profile methods, Hough transfor...

متن کامل

Text Detection in Multi-Oriented Natural Scene Images

---------------------------------------------------------------------***--------------------------------------------------------------------Abstract With the growing number of digital multimedia libraries, the need to efficiently index, browse and retrieve multimedia information is increased. Text embedded in images and video frames can help to identify the image information (e.g. somebody's na...

متن کامل

Automatic identification and skew estimation of text lines in real scene images

A method for the automatic localization of text embedded in complex images is proposed. It permits to detect the spatial position and the skew of the text lines which are present in the scene and to return a binary representation of each text line. Strenghts of the algorithm are independece of text skew and of presence of connected text. After a preprocessing step the input image is segmented i...

متن کامل

Skew detection and text line position determination in digitized documents

-This paper proposes a computationally efficient procedure for skew detection and text line position determination in digitized documents, which is based on the cross-correlation between the pixels of vertical lines in a document. The determination of the skew angle in documents is essential in optical character recognition systems. Due to the text skew, each horizontal text line intersects a p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002